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Thermostable DNA Polymerase from Carboxydothermus hydrogenoformans 

5 

The present invention relates to a thermostable enzyme which is a DNA polymerase 
obtainable from Carboxydothermus hydrogenoformans. 

Furthermore, the present invention relates to the field of molecular biology and provides 
10 improved methods for the replication and amplification of deoxyribonucleic (DNA) and 
ribonucleic acid (RNA) sequences. In a preferred embodiment, the invention provides a 
method for synthesizing a complementary DNA copy from an RNA template with a 
thermoreactive DNA polymerase. In another aspect, the invention provides methods for 
amplifying a DNA segment from an RNA or DNA template using a thermostable DNA 
1 5 polymerase (RT-PCR or PCR). 

Heat stable DNA polymerases (EC 2.7.7.7. DNA nucleotidyltransferase, DNA-directed) 
have been isolated from numerous thermophilic organisms (for example: Kaledin et al. 
(1980). Biokimiya 45, 644-651; Kaledin et al. (1981) Biokimiya46. 1247-1254: fCaledin 
20 etal. (1982) Biokimiya 47, 1515-1521; Ruttimann et al. (1985) Eur. J. Biochem. 149, 
41-46; Neuner et al. (1990) Arch. Microbiol. 153, 205-207). 

For some organisms, the polymerase gene has been cloned and expressed (Lawyer et al. 
(1989) J. Biol. Chem. 264, 6427-6437; Engelke et al. (1990) Anal. Biochem. 191, 396- 
400; Lundberg et al. (1991) Gene 108. 1-6; Perler et al. (1992) Proc. Natl. Acad. Sci. 
25 USA 89, 5577). 

Thermophilic DNA polymerases are increasingly becoming important tools for use in 
molecular biology and there is growing interest in finding new polymerases which have 
more suitable properties and activities for use in diagnostic detection of RNA and DNA, 
30 gene cloning and DNA sequencing. At present, the thermophilic DNA polymerases 
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mostly used for these purposes are from Thermus species like Taq polymerase from T. 
aquaticus (Brock et ai. (1969) J. Bacteriol. 98 ? 289-297) 

Reverse transcription is commonly performed with viral reverse transcriptases like the 
enzymes isolated from Avian myeloblastosis virus or Moloney murine leukemia virus, 
which are active in the presence of Magnesium ions but have the disadvantages to 
possess RNase H-activity, which destroys the template RNA during the reverse trans- 
cription reaction and have a temperature optimum at at 42°C or 37 °C, respectively. 

Alternative methodes are described using the reverse transcriptase activity of DNA 
polymerases of thermophilic organisms which are active at higher temperatures. Reverse 
transcription at higher temperatures is of advantage to overcome secondary structures of 
the RNA template which could result in premature termination of products. 
Thermostable DNA polymerases with reverse transcriptase activities are commonly 
isolated from Thermus species. These DNA polymerases however, show reverse trans- 
criptase activity only in the presence of Manganese ions. These reaction conditions are 
suboptimaL because in the presence of Manganese ions the polymerase copies the tem- 
plate RNA with low fidelity. 

Another feature of the commonly used reverse transcriptases is that they do not contain 
3'-5'exonuclease activity. Therefore, misincorporated nucleotides cannot be removed 
and thus the cDNA copies from the template RNA may contain a significant degree of 
mutations. 

Therefore, it is desirable to develop a reverse transcriptase 

• which acts at higher temperatures to overcome secondary structures in the template to 
avoid premature termination of the reaction and to assure the production of cDNA 
without deletions 

• which is active in the presence of Magnesium ions in order to prepare cDNA from 
RNA templates with higher fidelity and 
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• which has 3'-5'-exonuclease in order to remove misincorporated nucleotides before 
continuation of DNA synthesis and to produce a product with a low mutation 



The present invention adresses these needs and provides a heat stable DNA polymerase 
active at higher temperatures which has reverse transcriptase activity in the presence of 
magnesium ions and and which has 3'-5'exonuclease activity. 

It is an object of this invention to provide a polymerase enzyme (EC 2.7.7.7.)* charac- 
terised in that it has reverse transcriptase activity in the presence of magnesium ions as 
well as in the presence of manganese ions. In another aspect the invention comprises a 
DNA polymerase isolated from Carboxydothermus hydrogenoformans (Deutsche 
Sammlung von Mikroorganismen und Zellkulturen GmbH, Mascheroder Weg lb, D- 
38124 Braunschweig, DSM No. 8979). In a further aspect the invention comprises a 
DNA polymerase having reverse transcriptase activity in the presence of magnesiums 
ions and in the substantial absence of manganese ions. In a further aspect the invention 
comprises a DNA polymerase having a molecular mass of about 100 to 1 05 kDa as 
determined by in situ PAGE analysis. In a further aspect the invention comprises a re- 
verse transcriptase which is thermostable. In a further aspect the invention comprises a 
DNA polymerase having 3'-5'-exonucIease activity. In a further aspect the invention 
comprises a recombinant DNA sequence that encodes DNA polymerase activity of the 
microorganism Carboxidothermus hydrogenoformans. In a related aspect, the DNA 
sequence is depicted as SEQ ID No. 7. In a second related aspect the invention com- 
prises a recombinant DNA sequence that encodes essentially amino acid residues 1 to 
831. In a further aspect the invention comprises a recombinant DNA plasmid that 
comprises the DNA sequence of the invention inserted into plasmid vectors and which 
can be used to drive the expression of the thermostable DNA polymerase of Carboxydo- 
thermus hydrogenoformans in a host cell transformed with the plasmid. In a further 
aspect the invention includes a recombinant strain comprising the vector pDS56 
carrying the Carboxydothermus hydrogenoformans DNA polymerase gene and 
designated pAR 4. The E.coli strain (BL21 (DE3)pUBS520) carrying the plasmid pAR4 
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was deposited on the Deutsche Sammlung von Mikroorganismen und Zellkulturen 
GmbH, Mascheroder Weg lb, D-38124 Braunschweig DSM No. 1 1 179) and is 
designated AR96 

5 In referring to a peptide chain as being comprised of a series of amino acids "substan- 
tially or effectively" in accordance with a list offering no alternatives within itself, we 
include within that reference any versions of the peptide chain bearing substitutions 
made to one or more amino acids in such a way that the overall structure and the overall 
function of the protein composed of that peptide chain is substantially the same as - or 
10 undetectably different to - that of the unsubstituted version. For example it is generally 
possible to exchange alanine and valine without greatly changing the properties of the 
protein, especially if the changed site or sites are at positions not critical to the morpho- 
logy of the folded protein. 

15 The DNA polymerase is "thermostable" meaning that it is stable to heat and preferen- 
tially active at higher temperatures, especially the high temperatures used for denatura- 
tion of DNA strands. More particularly, the thermostable DNA polymerases are not 
substantially inactivated at the high temperatures used in polymerase chain reactions. 

20 The term "reverse transcriptase" describes a class of polymerases characterized as RNA- 
dependent DNA polymerases. All known reverse transcriptases require a primer to 
synthesize a DNA transcript from an RNA template. Historically, reverse transcriptase 
has been used primarily to transcribe mRNA into cDNA which can then be cloned into a 
vector for further manipulation. 

25 

Other definitions are used in a manner consistent with the an. 

Carboxydothermus hydrogenoformans was isolated from a hot spring in Kamchatka by 
V. Svetlichny. A sample of C. hydrogenoformans was deposited on the Deutsche 
30 Sammlung von Mikroorganismen und Zellkulturen GmbH (DSM) under the terms of 
the Budapest Treaty and received Accession Number DSM 8979. The thermostable 
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polymerase isolated from Carboxydothermus hydrogenoformans has a molecuiar weight 
of 100 to 105 KDa and retains more than 60 % of its initial activity after heating to 95°C 
for 30 minutes.The thermostable enzyme possesses a 5 '-3' polymerase activity, a 3 '-5'- 
exonuclease activity, a 5'-3'- exonuclease activity and a reverse transcriptase-activity 
5 which is Mg^-dependent. The polymerase according to the present invention has 
reverse transcriptase activity in the presence of magnesium ions and in the substantial 
absence of manganese ions. The thermostable enzyme may be native or recombinant 
and may be used for first- and second-strand cDNA synthesis, in cDNA cloning, DNA 
sequencing, DNA labeling and DNA amplification. 

10 

For recovering the native protein C.hydrogenoformans may be grown using any suitable 
technique, such as the technique described by Svetlichny et al. (1991) System. Appl. 
Microbiol., 14, 205-208. After cell growth one preferred method for isolation and 
purification of the enzyme is accomplished using the multi-step process as follows: 

15 

The cells are thawed, suspended in buffer A (40 mM Tris-HCL pH 7.5, 0.1 mM EDTA, 
7 mM 2-mercaptoethanol, 0.4 M NaCl, 10 mM Pefabloc) and lysed by twofold passage 
through a Gaulin homogenizes The raw extract is cleared by centrifugation, the super- 
natant dialyzed against buffer B (40 mM Tris-HCl, pH 7.5, 0.1 mM EDTA, 7 mM 2- 

20 mercaptoethanol, 10 % Glycerol) and brought onto a column filled with Heparin-Se- 
pharose (Pharmacia). In each case the columns are equilibrated with the starting solvent 
and after the application of the sample washed with the threefold of its volume with this 
solvent. Elution of the first column is performed with a linear gradient of 0 to 0.5 M 
NaCl in Buffer B. The fractions showing polymerase activity are pooled and ammonium 

25 sulfate is added to a final concentration of 20 %. This solution is applied to a hydropho- 
bic column containing Butyl-TSK-Toyopearl (TosoHaas). This time the column is 
eluted with a falling gradient of 20 to 0 % ammonium sulfate. The pool containing the 
activity is dialysed and again transferred to a column, this time with DEAE-Sepharose 
(Pharmacia), and eluted with a linear gradient of 0-0.5 M NaCl in buffer B. The fourth 

30 column contains Tris-Acryl-Blue (Biosepra) and is eluted as in the preceding case. 
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Finally the active fractions are dialyzed against buffer C (20 mM Tris-HCl, pH 7.5, 
0.1 mM EDTA, 7.0 mM 2-mercaptoethanoL 100 mM NaCl, 50 % Glycerol. 

Isolation of recombinant DNA polymerase from Carboxydothermus hydrogenoformans 
5 may be performed with the same protocol or with other commonly used procedures. 

DNA polymerase activity was measured by incorporation of digoxigenin-labeled dUTP 
into the sythesized DNA and detection and quantification of the incorporated digoxi- 
genin essentially according to the method described in Holtke, H.-J.; Sagner, G; Kessler, 
10 C. and Schmitz, G. (1992) Biotechniques 12, 104 -113. 

Determination of reverse transcriptase activity is performed essentially as described for 
determination of DNA polymerase activity except that the reaction mixture consists of 
the components as described by example 3. 

1 5 In situ PAGE analysis of polymerase activity and reverse transcriptase activity was 

performed essentially according to the method described by Spanos A. and Hiibscher U. 
((1983) Methods in Enzymology 91, 263-277). Some minor, but essential modifications 
to the original method are, that the renaturation of the SDS-denatured polypeptides is 
performed in the presence of magnesium ions (3 mM) and dATP (0.5-1 \iM) to assist 

20 refolding. 

3'-5' exonuclease activity is commonly referred as "proofreading" or "editing" activity 
of DNA polymerases . It is located in the small domain of the large fragment of Type A 
polymerases. This activity removes mispaired nucleotides from the 3 T end of the primer 
25 terminus of DNA in the absence of nucleoside triphosphates (Kornberg A. and Baker 
T.A.(1992) DNA Replication W. H. Freemann & Company, New York). This nuclease 
action is suppressed by deoxynucleoside triphosphates if they match to the template and 
can be incorporated into the polymer. 

30 The 3*- 5' exonuclease activity of the claimed DNA polymerase can be measured as 
degradation or shortening of a 5'-digoxygenin-labeled oligonucleotide annealed to 
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template DNA in the absence or presence of deoxyribonucleoside triphosphates or on 
DNA fragments in the absence or presence of deoxyribonucleoside triphosphates. 

Carboxydothermus hydrogenoformans DNA polymerase is the first DNA polymerase 
5 isolated from thermophilic eubacteria with a higher activity in the presence of magne- 
sium ions than in the presence of manganese ions as shown in figure 1 . Compaired to 
the DNA polymerase activity the reverse transcriptase activity in the presence of magne- 
sium is relatively high. This is shown - in comparison with DNA polymerases from 
T.filiformis, A.thermophilum and the most commonly used DNA polymerase for re- 
10 verse transcription T.thermophilus in figure 6. The reverse transcriptase activity in de- 
pendence of magnesium is of advantage since the DNA polymerases synthesize DNA 
with higher fidelity in the presence of magnesium than in the presence of manganese 
(Beckmann R. A. et al. (1985) Biochemistry 24, 5810-5817; Ricchetti M. and Buc H. 
(1993) EMBO J. 12, 387-396). Low fidelity DNA synthesis is likely to lead to mutated 
15 copies of the original template. In addition, Mn 2+ ions have been implicated in an in- 
creased rate of RNA degradation, particularly at higher temperatures and this can cause 
the synthesis of shortened products in the reverse transcription reaction. 

The DNA sequence (SEQ ID No.: 7) of Carboxydothermus hydrogenoformans poly- 
20 merase and the derived amino acid sequence of the enzyme are shown in figure 5. The 
molecular weight deduced from the sequence is 94 348 Da, in SDS polyacrylamide gel 
electrophoresis however the Carboxydothermus hydrogenoformans polymerase has an 
electrophoretic mobility higher than Exoli pol I (109 kDa) and a lower mobility than 
Taq polymerase (94 kDa) and Klenow fragment (76 kDa) as shown in figure 2. Compar- 
25 ing the migration properties of Taq and E.coli DNA polymerases with those of Carbo- 
xydothermus hydrogenoformans polymerase a molecular weight of 100 to 105 kDa can 
be deduced. Since the Carboxydothermus hydrogenoformans polymerase isolated from 
the native strain has the same migration properties as the recombinant enzyme the 
„slower ; * migration during SDS gel electrophoresis must rather be a property of the en- 
30 zyme than a cloning artefact. A possible explanation for this phenomenon could be that 
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this enzyme which is derived from a thermophilic organism has a very stable structure 
which is not completely unfolded under the standard denaturation conditions used. 

The production of a recombinant form of Carboxydothermus hydrogenoformans DNA 
polymerase generally includes the following steps: chromosomal DNA from Carbo- 
xydothermus hydrogenoformans is isolated by treating the cells with detergent e.g. SDS 
and a proteinase e.g. Proteinase K. The solution is extracted with phenol and chloroform 
and the DNA purified by precipitation with ethanol. The DNA is dissolved in Tris/ 
EDTA buffer and the gene encoding the DNA polymerase is specifically amplified by 
the PCR technique using two mixed oligonucleotides (primer 1 and 2). These oligo- 
nucleotides, described by SEQ ID No.: 1 and SEQ ID No.: 2, were designed on the basis 
of conserved regions of family A DNA polymerases as published by Braithwaite D. K. 
and Ito J., 1993, Nucl. Acids Res. Vol. 21, p. 787 - 802. The specifically amplified 
fragment is Iigated into an vector, preferably the pCR™II vector (Invitrogen) and the 
sequence is determined by cycle-sequencing. Complete isolation of the coding region 
and the flanking sequences of the DNA polymerase gene can be performed by restriction 
fragmentation of the Carboxydothermus hydrogenoformans DNA with another 
restriction enzyme as in the first round of screening and by inverse PCR (Innis et ai., 
(1990) PCR Protocols; Academic Press, Inc., p. 219-227). This can be accomplished 
with synthesized oligonucleotide primers binding at the outer DNA sequences of the 
gene part but in opposite orientation. These oligonucleotides described by SEQ ID Nos. 
3 and 4 , were designed on the basis of the sequences which were determined by se- 
quencing of the first PCR product described above. As template Carboxydothermus 
hydrogenoformans DNA is used which is cleaved by restriction digestion and circular- 
ized by contacting with T4 DNA ligase. To isolate the coding region of the whole poly- 
merase gene, another PCR is performed using primers as shown in SEQ ID Nos. 5 and 6 
to amplify the complete DNA polymerase gene directly from genomic DNA and intro- 
ducing ends compatible with the linearized expression vector. 
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SEQ ID No. 1: 

Primer 1: 5'-CCN AAY YTN CAR AAY ATH-3' 
5 SEQ ID No. 2: 

Primer 2: 5'-YTC RTC RTG NAC YTG-3' 
SEQ ID No. 3: 

10 

Primer 3: 5'-GGG CGA AGA CGC TAT ATT CCT GAG C-3' 
SEQ ID NO. 4: 

15 Primer 4: 5'-GAA GCC TTA ATT CAA TCT GGG AAT AAT C-3' 
SEQ ID NO. 5: 

. Primer 5 : 5'-CGA ATT CAA TCC ATG GGA AAA GTA GTC CTG GTG GAT-3' 

20 

SEQ ID NO. 6: 

Primer 6: 5'-CGA ATT CAA GGA TCC TTA CTT CGC TTC ATA CCA GTT-3' 

25 The gene is operably linked to appropriate control sequences for expression in either 
prokaryotic or eucaryotic host/vector systems. The vector preferably encodes all func- 
tions required for transformation and maintenance in a suitable host, and may encode 
selectable markers and/or control sequences for polymerase expression. Active recom- 
binant thermostable polymerase can be produced by transformed host cultures either 

30 continuously or after induction of expression. Active thermostable polymerase can be 
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recovered either from host cells or from the culture media if the protein is secreted 
through the cell membrane. 

It is also preferable that Carboxydothermus hydrogenoformans thermostable polymerase 
5 expression is tightly controlled in E.coli during cloning and expression. Vectors useful 
in practicing the present invention should provide varying degrees of controlled expres- 
sion of Carboxydothermus hydrogenoformans polymerase by providing some or all of 
the following control features: (1) promoters or sites of initiation of transcription, either 
directly adjacent to the start of the polymerase gene or as fusion proteins, (2) operators 

10 which could be used to turn gene expression on or off, (3) ribosome binding sites for 
improved translation, and (4) transcription or translation termination sites for improved 
stability. Appropriate vectors used in cloning and expression of Carboxydothermus 
hydrogenoformans polymerase include, for example, phage and plasmids. Example of 
phage include lambda gtll (Promega), lambda Dash (Stratagene) lambda ZapII (Strata- 

15 gene). Examples of plasmids include pBR322, pBTac2 (Boehringer Mannheim ), 
pBluescript (Stratagene), pSP73 (Promega), pET3A (Rosenberg, supra, (1987) Gene 
56:125-135), pASK75 (Biometra), pDS56 (Stiiber, D. f Matile, H. and Garotta G. (1990) 
Immunological Methods, Letkovcs, I. and Pemis, B., eds.) and pETl 1C (Studier, F. W. 
(1990) Methods in Enzymology, 1 85:60-89). According to the present invention the use 
20 of a piasmid has shown to be advantageously, particularly pDS56. The Plasmid pDS56 
carrying the Carboxydothermus hydrogenoformans DNA polymerase gene is then desig- 
nated pAR4. 

Standard protocols exist for transformation, phage infection and cell culture (Maniatis, 
25 et al. (1982) Molecular Cloning: A Laboratory Manual, Cold Spring Harbour Laboratory 
Press). Of the numerous E.coli strains which can be used for plasmid transformation, the 
preferred strains include JM1 10 (ATCC 47013), LE392 pUBS 520 (Maniatis et al. 
supra; Brinkmannet ah, (1989) Gene 85:109-1 14;), JM101 (ATCC No. 33876), XL1 
(Stratagene). and RR1 (ATCC no. 31343), BL21 (DE3) pUBS520 (Brinkmann. U. et al. 
30 (1989) Gene 85, 109-1 14) and BL21 (DE3) plysS (Studier, F. W. et al., (1990) Methods 
in Enzymology, supra). According to the present invention the use of the E. coli strain 
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BL21 (DE3) pUBS520 has shown to be advantageously. The E. coii strain BL21 (DE3) 
pUBS520 transformed with the plasmid pAR4 is then designated AR96(DSM No. 
1 1 179). E.coli strains XLl-Blue (Stratagene). and ER1458 (Raleigh, E. A. et aL (1988) 
Nucleic Acids Research 16:1563-1575) are among the strains that can be used for 
5 lambda phage, and Y1089 can be used for lambda gtll lysogeny. The transformed cells 
are preferably grown at 37°C and expression of the cloned gene is induced with IPTG. 

Isolation of the recombinant DNA polymerase can be performed by standard techniques. 
Separation and purification of the DNA polymerase from the E.coli extract can be per- 

10 formed by standard methods. These methods include, for example, methods utilizing 
solubility such as salt precipitation and solvent precipitation, methods utilizing the 
difference in molecular weight such as dialysis, ultra-filtration, gel-filtration, and SDS- 
polyacrylamide gel electrophoresis, methods utilizing a difference in electric charge 
such as ion-exchange column chromatography, methods utilizing specific interaction 

15 such as affinity chromatography, methods utilizing a difference in hydrophobicity such 
as reversed-phase high performance liquid chromatography and methods utilizing a 
difference in isoelectric point such as isoelectric focussing electrophoresis. 

The present invention provides improved methods for efficiently transcribing RNA and 
20 amplifying RNA or DNA. These improvements are achieved by the discovery and 
application of previously unknown properties of thermoactive DNA polymerases. 

The thermostable enzyme of this invention may be used for any purpose in which such 
enzyme activity is necessary or desired. In a particularly preferred embodiment, the en- 

25 zyme catalyzes the nucleic acid amplification reaction known as PGR. This process for 
amplifying nucleic acid sequences is disclosed and claimed in U.S. Patent No. 
4,683,202. The PCR nucleic acid amplification method involves amplifying at least one 
specific nucleic acid sequence contained in a nucleic acid or a mixture of nucleic acids 
and produces double-stranded DNA. Any nucleic acid sequence, in purified or nonpuri- 

30 fied form, can be utilized as the starting nucleic acid(s), provided it contains or is sus- 
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pected to contain the specific nucleic acid sequence desired. The nucleic acid to be 
amplified can be obtained from any source, for example, from plasmids such as 
pBR322, from cloned DNA or RNA, from natural DNA or RNA from any source, in- 
cluding bacteria, yeast, viruses, organelles, and higher organisms such as plants and 
animals, or from preparations of nucleic acids made in vitro. 

DNA or RNA may be extracted from blood, tissue material such as chorionic villi, or 
amniotic cells by a variety of techniques. See, e.g., Maniatis et al., 1982, Molecular 
Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory, Cold Spring Harbor, 
New York) pp. 280-281. Thus the process may employ, for example, DNA or RNA, 
including messenger RNA, which DNA or RNA may be single-stranded or double- 
stranded. In addition, a DNA-RNA hybrid which contains one strand of each may be 
utilized. 



1 5 The amplification of target sequences in DNA or from RNA may be performed to proof 
the presence of a particular sequence in the sample of nucleic acid to be analyzed or to 
clone a specific gene. DNA polymerase from Carboxydothermus hydrogenoformans is 
very useful for these processes. Due to its 3'-5'exonuclease activity it is able to synthe- 
size products with higher accuracy as the reverse transcriptases of the state of the art. 

20 

DNA polymerase from Carboxydothermus hydrogenoformans may also be used to 
simplify and improve methods for detection of RNA target molecules in a sample. In 
these methods DNA polymerase from Carboxydothermus hydrogenoformans may cata- 
lyze: (a) reverse transcription, (b) second strand cDNA synthesis, and, if desired, 

25 (c) amplification by PCR. The use of DNA polymerase from Carboxydothermus hydro- 
genoformans in the described methods would eliminate the previous requirement of two 
sets of incubation conditions which are necessary due to the use of different enzymes for 
each step. The use of DNA polymerase from Carboxydothermus hydrogenoformans may 
be used to perform RNA reverse transcription and amplification of the resulting comple- 

30 mentary DNA with enhanced specificity and with fewer steps than previous RNA 
cloning and diagnostic methods. 
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Brief description of the drawings 

Fig. 1 shows the relative reverse transcriptase activity of DNA polymerase from Carbo- 
xydothermus hydrogenoformans in dependence of magnesium and manganese salt. 

5 



DNA polymerase from Carboxydothermus hydrogenoformans and reference poly- 
merases is detected in situ. A fraction of DNA polymerase from C. hydrogenoformans 
and reference enzyme were submitted to electrophoresis on a SDS-polyacrylamide gel 
10 containing activated calf thymus DNA. After electrophoresis the SDS was removed, the 
proteins were renatured and incubated at 65°C in the presence of magnesium salt, 
dNTPs and digoxigenin labeled dUTP to allow DNA synthesis. The nucleic acid was 
blotted to a nylon membrane and the newly synthesized DNA detected by a chemilumin- 
escence reaction. 



Fig. 3 shows the thermostability of DNA polymerase from Carboxydothermus hydro- 
genoformans. Aliquots of the DNA polymerase were incubated for 30 min. at the tem- 
peratures indicated in the figure, and subsequently the remaining enzyme activity was 
determined. 



Fig. 4 shows the analysis for 3 '-5 '-exonuclease activity of DNA polymerase from Car- 
boxydothermus hydrogenoformans iri comparison with DNA polymerase from Thermus 
aquaticus and Pyrococcus woeseii. The analysis is performed in the presence or absence 
of dNTPs. A 22mer primer labeled with digoxigenin at the 5 '-end was annealed to a 
25 34mer template DNA leaving a 12 bp 5 'overhang of template DNA. DNA polymerases 
from Carboxydothermus hydrogenoformans, Thermus aquaticus and Pyrococcus woe- 
seii were incubated with this substrate in the presence of magnesium with or without 
dNTPs. The products were separated on a sequencing gel, blotted to a nylon membrane 
and detected by a chemiluminescence reaction. 



Fig. 2 shows a photograph of a DNA polymerase assay performed in situ. The activity of 



15 



20 
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Fig. 5 shows the DNA sequence of the polymerase gene of Carboxydothermus hydro- 
genoformans with SEQ ID No.; 7 and the derived peptide sequence of the DNA poly- 
merase protein with SEQ ID No. 8. 

5 Fig. 6 shows a comparison of the reverse transcriptase activity thermostable DNA poly- 
merases Carboxydothermus hydrogenoformans, Anaerocellum thermophilum, Thermus 
filiformis (Pacific Enzymes) and Thermus thermophilus. Similar amounts (units) of the 
DNA polymerases were analyzed. Each enzyme was tested for DNA polymerase activ- 
ity, for reverse tanscriptase activity in the presence of Mg^ (5 mM) and reverse trans- 

1 0 criptase activity in the presence of Mn~ (1 mM) under the reaction conditions optimal 
for the individual enzymes. DNA synthesis was measured by incorporation of digoxi- 
genin-labeled nucleotides. In order to compare the ratio of DNA polymerase to reverse 
transcriptase activity, the relative light units (RLU) measured in the DNA polymerase 
assay was set to 100. The RLUs measured in the reverse transcriptase activity tests are 

15 expressed as percent of the polymerase activity. 



Example 1 

20 Detection of endonuclease. exonuclease and ribonuclease activities: 

Absence of endonuclease activity: 1 jig of plasmid DNA is incubated for 4 hours with 
an excess of purified DNA polymerase in 50 \xl of test buffer with a paraffin oil overlay 
at 72°C. 

25 Absence of nonspecific exonuclease activity: 1 jig of EcoRI/HindHI- fragments of 
lambda DNA are incubated in 100 |xl of test buffer in the absence and presence of 
dNTPs (ImM final concentration each) with an excess of purified DNA polymerase for 
4 hours at 72°C. - 

Absence of ribonuclease activity: 3 |ig of MS2 RNA are incubated with an excess of 
30 DNA polymerase in 20 ^1 of test buffer for 4 hours at 72°C. The RNA is subsequently 
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analyzed by electrophoresis in a MOPS gel (Maniatis et al. (1982) Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor, New York). 

5 Example 2 

Determination of DNA polymerase activity 

DNA polymerase activity was measured by incorporation of digoxigenin-labeled dUTP 
10 into the sythesized DNA and detection and quantification of the incorporated digoxige- 
nin essentially according to the method described in Holtke, H.-J.; Sagner, G; Kessler, 
C. and Schmitz, G. (1992) Biotechniques 12, 104 -1 13. The reaction is performed in a 
reaction volume of 50 pi containing 1 or 2 pi of diluted (0.05 U - 0.01 U) DNA poly- 
merase and 50 mM Tris-HCl, pH 8.5; 12.5 mM (NH 4 ) 2 S0 4 ; 10 mM KC1; 5 mM 
15 MgCl 2 ; 10 mM 2-mercaptoethanol; 33 pM dNTPs; 200 pg/ml BSA;12 pg of DNAse I- 
activated DNA from calf thymus and 0.036 pM digoxigenin-dUTP. 

The samples are incubated for 30 min. at 72°C, the reaction is stopped by addition of 
2 pi 0.5 M EDTA. and the tubes placed on ice. After addition of 8 pi 5 M NaCl and 
20 150 pi of Ethanol (precooled to -20°C) the DNA is precipitated by incubation for 

15 min. on ice and pelleted by centrifugation for 10 min at 13000 x rpm and 4°C. The 
pellet is washed with 100 pi of 70% Ethanol (precooled to -20°C) and 0.2 M NaCl, 
centrifuged again and dried under vacuum. 

25 The pellets are dissolved in 50 pi Tris-EDTA (10 mM/0.1 mM; pH 7.5). 5 pi of the 
sample are spotted into a well of a nylon membrane bottomed white microwell plate 
(Pall Filtrationstechnik GmbH, Dreieich, FRG, product no: SM045BWP). The DNA is 
fixed to the membrane by baking for 10 min. at 70°C. The DNA loaded wells are filled 
with 100 pi of 0.45 pm-filtrated 1 % blocking solution (100 mM maleic acid, 150 mM 

30 NaCl, 1 % (w/v) casein, pH 7.5). All following incubation steps are done at room tem- 
perature. After incubation for 2 min. the solution is sucked through the membrane with 
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a suitable vacuum manifold at -0.4 bar. After repeating the washing step, the wells are 
filled with 100 ^1 of a 1:10 000-dilution of Anti-digoxigenin-AP r Fab fragments 
(Boehringer Mannheim, FRG, no: 1093274) diluted in the above blocking solution. After 
incubation for 2 min. and sucking this step is repeated once. The wells are washed twice 
5 under vacuum with 200 |il each time washing-buffer 1 (100 mM maleic-acid, 150 mM 
NaCL 0.3 %(v/v) Tween™ 20 (Poly(oxyethylen) n -Sorbitan-monolaurat), pH 7.5). After 
washing another two times under vacuum with 200 ^il each time washing-buffer 2 
(10 mM Tris-HCl, 100 mM NaCl, 50 mM MgCl 2 , pH 9.5) the wells are incubated for 
5 min. with 50 \xl of CSPD™ (Boehringer Mannheim, no: 1655884), diluted 1 : 100 in 
1 0 washing-buffer 2, which serves as a chcmiluminescent substrate for the alkaline phos- 
phatase. The solution is sucked through the membrane and after 10 min. incubation the 
RLU/s (Relative Light Unit per second) are detected in a Luminometer e.g. MicroLumat 
LB 96 P (EG&G Berthold, Wilbad, FRG). 

15 With a serial dilution of Taq DNA polymerase a reference curve is prepared from which 
the linear range serves as a standard for the activity determination of the DNA poly- 
merase to be analyzed. 

20 Example 3 

Determination of reverse transcriptase activity 

The assay is performed essentially as described for determination of DNA polymerase 
25 activity except that the reaction mixture consists of the following components: 1 ug of 
polydA-(dT) 15 , 33 yM of dTTP, 0.36 \xM of digoxigenin-dUTP, 200 mg/ml BSA, 
10 mM Tris-HCl, pH 8.5, 20 mM KC1, 5 mM MgCl 2 , 10 mM DTE and various amounts 
of DNA polymerase The incubation temperature used is 50°C. 



30 
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Example 4 

Detection of DNA polymerase activity in situ 

5 In situ PAGE analysis of polymerase activity and reverse transcriptase activity was 

performed essentially according to the method described by Spanos A. and Hubscher U. 
(1983) Methods in Enzymology 91, 263-277. Some minor, but essential modifications 
to the original method are, that the renaturation of the SDS-denatured polypeptides is 
performed in the presence of magnesium ions (3 mM) and dATP (0.5-1 jiM) to assist 
10 refolding. 

In brief the method is as follows: 

After separation of the polypeptides from either crude cell extracts or purified samples 
on denaturing 8 % polyacrylamide gels (stacking gel 5 % acrylamide) which contain 

15 150 (ig activated calf thymus DNA per ml gel volume, the gels are washed four times 
for 30 min each at room temperature with moderate shaking in excess renaturation 
buffer (Tris-HCl, 50 mM, pH 8.3; EDTA, 1 mM; 2-mercaptoethanol, 3 mM; KC1, 
50 mM; glycerol, 5-10 %) to remove SDS. Then the gels are incubated overnight in the 
same buffer, including 3 mM MgCl 2 and 0.5-1 ^M dATP at 4°C without agitation. The 

20 first four washes are repeated the next day with renaturation buffer. 

Subsequent to the removal of SDS and renaturation of the proteins the gel is transferred 
into reaction mixture, consisting of Tris-HCl, 50 mM, pH 8.3; KCL 50 mM; DTT, 3 
mM; MgCl 2 , 7mM; 12 ^M of each dATP, dCTP, dGTP, 8 \xM dTTP and 4 ^M Dig- 
dUTP; 10 % (vol/vol) glycerol. Gels were first incubated under shaking at room tem- 

25 perature for one hour and then warmed up stepwise to 37°C, 45°C, 55°C, 65°C and 72°C. 
At each incubation temperature DNA synthesis was allowed to proceed for 60 minutes. 

After DNA synthesis, the DNA is transferred either by contact blotting or by capillary 
blotting (15 x SSC, Maniatis et aL supra) to nylon membranes (Boehringer Mannheim, 
30 GmbH) and crosslinked. 
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The detection of newly synthesized, digoxigenin-labeled DNA followed the procedure 
given in the previous section (Determination of DNA polymerase activity). 

For molecular weight determination marker polymerases of known molecular weight 
5 (e.g. Klenow-polymerase, Pol I, Taq polymerase, Tth polymerase, HIV RT, M-MuLV 
RT) are applied onto the same gel, but different lanes. 

The molecular weight of the claimed DNA polymerase according to this method is 100 
to 105 kDa. 

10 

Example 5 

Detection of 3'-5 , exonuclease activity 

15 

3'-5' exonuclease activity is commonly referred as "proofreading" or "editing" activity 
of DNA polymerases . It is located in the small domain of the large fragment of Type A 
polymerases. This activity removes nucleotides from the 3' end of the primer terminus 
of DNA in the absence of nucleoside triphosphates (Kornberg A. and Baker T.A.(1992) 
20 DNA Replication W. H. Freemann & Company, New York). This nuclease action is 
suppressed by deoxynucleoside triphosphates if they match to the template and can be 
incorporated into the polymer. 

The 3 '-5' exonuclease activity of the claimed DNA polymerase can be measured as de- 
25 gradation or shortening of a 5'-digoxygenin-labeled oligonucleotide annealed to tem- 
plate DNA in the absence or presence of deoxyribonucieoside triphosphates or on DNA 
fragments in the absence or presence of deoxyribonucieoside triphosphates. 

Degradation of digoxigenin labeled oligonucleotide: The reaction mixture is essentially 
30 the same as that for determination of DNA polymerase activity (50 mM Tris-HCl, 

pH 8.4. 12.5 mM (NH 4 )2S0 4 ; 10 mM KC1; 5 mM MgCl 2 10 mM 2-mercaptoethanol), 
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except that the dNTP concentration was reduced to 12.5 \iM and activated calf thymus 
DNA was replaced by 500 fMol primer or template/primer mixture. 

The primer sequence is: 
5 SEQ ID NO. 8.: 

Dig-GCATGGATCCCACTGCCCAGGG (5' to 3'). This primer is annealed with 
template molecules of various 12 bp 5 prime overhangs. DNA polymerase samples of 
typically 0.1 units are incubated in a total reaction volume of 10 (il for 30 min at 72°C in 
a Perkin Elmer thermal cycler. Reactions are stopped by adding an equal volume of 
10 formamide-buffer (98 % formamide; 10 mM EDTA; bromphenol blue and xylencyanol) 
and denatured by heating for 10 min at 95°C. Samples are quickly chilled on ice and 
loaded on a 20 % denaturing polyacrylamide/urea sequencing gel. Electrophoresis is 
performed at 60°C and 2000 V for 2.5 hours. 

15 After separation DNA is transferred onto a positively charged nylon membrane 

(Boehringer Mannheim) by contact blotting for 30 min. The DNA is crosslinked to the 
membrane by UV-irradiation with 120 mJoule (Stratalinker. Stratagene). The membrane 
is blocked with blocking solution (100 mM maleic-acid, 150 mM NaCl, 1 %(w/v) 
casein. pH is adjusted to 7.5 with 1 M NaOH) at room temperature for at least 30 min. 

20 digoxigenin-labeled primer DNA is detected with anti Digoxigenin-AP ? Fab-fragments 
(Boehringer Mannheim, FRG, no 1093274) diluted 1:10000 in blocking solution 
(30 min at room temperature). Excess unbound antibody is removed by washing 3-4 
times (10-15 min, each step) with washing buffer (100 mM maleic-acid, 150 mM NaCl, 
0.3% (v/v) Tween™ 20 (Poly(oxyethylen) n -sorbitan-monolaurat), pH 7.5). The mem- 

25 brane is transferred into a buffer containing 10 mM Tris-HCI, 100 mM NaCl, pH 9:5) 
and washed twice for additional 10-15 min at room temperature. Finally the membrane 
is soaked with a 1 : 1000 diluted solution of CDP-Star™ (Boehringer Mannheim). CDP- 
Star™ serves as a chemiluminiscent substrate for alkaline phosphatase. Then the mem- 
brane is transferred on filter paper (Whatman 3 MM) to remove excess fluid, positioned 

30 between two sheets of transparent overhead foils and exposed to X-ray films (Chemi- 
luminescent Detection Film. Boehringer Mannheim) for 5-10 min. 3*-5' exonuclease 
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activity is detected by degradation or shortening of the primer compared with a control 
(no polymerase added). As negative and positive controls DNA polymerases from Ther- 
mus aquaticus (no 3 'to 5 'exonuclease activity) and from Pyrococcus woeseii (exhibiting 
3 'to 5 'exonuclease activity) are included. 

5 

Degradation of DNA fragments in the presence or absence of deoxynucleoside triphos- 
phates: A series of dilutions of Chy polymerase was incubated for 2 hours at 70°C with 
1 iig of DNA molecular weight marker III (Boehringer Mannheim) in the presence and 
absence of dNTPs, 1 mM each, in 50 \xl of the following incuabation buffer: 50 mM 
10 Tris-HCl, pH 7.8; 10 mM MgCl 2 ; 7 mM 2-mercaptoethanol with Paraffin overlay. The 
DNA fragments were separated on a 1 % agarose gel containing ethidium bromide. In 
the absence of dNTPs a smear of DNA fragments or no DNA could be detected while in 
the presence of dNTPs the DNA fragments remained undegraded. 

15 

Example 6 

Cloning of the Carboxydothermus hydrogenoformans DNA polymerase gene. 

20 Preparation of chromosomal DNA from Carboxydothermus hydrogenoformans. 

0.8 g biomass of Carboxydothermus hydrogenoformans was suspended in 20 ml 1 M 
KC1 and centrifuged. Then the pellet was resuspended in 4.8 ml SET-buffer (150 mM 
NaCl, 15 mM EDTA, pH 8.0, 60 mM Tris-HCl, pH 8.0, 50 ng/^il RNaseA), after which 
I ml 20 % SDS and 50 jal of proteinase K (10 mg/ml) were added. The mixture was kept 

25 at 37°C for 45 minutes. After extraction with phenol and chloroform the DNA was pre- 
cipitated with ethanol and dissolved in H 2 0. Thus about 4. 1 mg of DNA were obtained. 

Amplification of specific DNA by PCR. 

For amplification of the gene encoding the DNA polymerase of Carboxydothermus 
30 hydrogenoformans by the PCR technique two mixed oligonucleotides (primer 1 and 2) 
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21 

were designed on the basis of conserved regions of family A DNA polymerases as pub- 
lished by Braithwaite D.K. and Ito J. (1993) Nucl. Acids Res. 21, 787-802. 

SEQ ID No.: 1 

Primer 1 : 5 '-CCN AAY YTN CAR AAY ATH-3 ' 
SEQ ID No.: 2 

Primer 2: 5 '-YTC RTC RTG NAC YTG-3 ' 

The PCR amplification was performed in 100 ul buffer containing 750 ng of genomic 
DNA from Carboxydothermus hydrogenoformans, 10 mM Tris-HCl, pH 8.8, 2.5 mM 
MgCI 2 , 50 mM KC1, 200 uM dNTPs, 100 pmoles of each primer and 2.5 units of Taq 
polymerase (Boehringer Mannheim GmbH, FRG). The target sequence was amplified 
by first denaturing at 95°C for 2 min. followed by 30 cycles of 95°C for 0.5 min, 47°C 
for 1 min. and 72°C for 2 minutes. Thermal cycling was performed in a Perkin Elmer 
GenAmp 9600 thermal cycler. Agarose gel electrophoresis showed, that a fragment of 
approximately 600 base pairs was amplified specifically. This fragment was ligated into 
the pCR™II vector (Invitrogen) and the sequence determined by cycle-sequencing. The 
amino acid sequence deduced from this nucleotide sequence was very similar to that of 
other known DNA polymerases, so that primer 3 and 4 could be designed for inverse 
PCR. 

SEQ ID No.: 3 



Primer 3: 5'-GGG CGA AG A CGC TAT ATT CCT GAG C-3' 
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SEQ ID No.: 4 

Primer 4: 5'-GAA GCC TTA ATT CAA TCT GGG AAT AAT C-3* 

5 Inverse PCR was performed essentially as described in Triglia T. et al. (1988) Nucleic 
Acids Res. 16, 8186. 5 ng genomic DNA from Carboxydothermus hydrogenoformans 
were cleaved by EcoRI according to supplier's specifications (Boehringer Mannheim 
GmbH) and treated with an equal volume of phenol/chloroform mixture. The aqueous 
phase was removed, the DNA precipitated with ethanol and collected by centrifiigation. 

10 

For circularization the digested DNA was diluted to a concentration of 50 ng/^il in liga- 
tion buffer (Boehringer Mannheim GmbH, FRG). The ligation reaction was initiated by 
the addition of T4 DNA Ligase (Boehringer Mannheim GmbH, FRG) to a concentration 
of 0.2 units/|il and the reaction was allowed to proceed for 15 hrs at 15°C. The ligated 
1 5 DNA was then precipitated with ethanol and collected by centrifiigation. 

The PCR was performed in 50 ^1 buffer containing 50 mM Tris-Cl, pH 9.2, 16 mM 
(NH4) 2 S0 4 , 2.25 mM MgCl 2 , 2 % (v/v) DMSO, 0.1 % (v/v) Tween™ 20, 700 ng of 
circularized DNA obtained as described above, 50 pmoles of each primer. 500 jaM 
20 dNTP and 0.75 yl enzyme mix (Expand Long Template PCR System, Boehringer 
Mannheim GmbH). 

The cycle conditions were as follows: 

25 1 x denaturation of template for 2 min. at 92°C 

denaturation at 92°C for 10 sec. 



lOx 
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20 x 



denaturation at 92°C for 10 sec. 
annealing at 64°C for 30 sec. 
elongation at 68°C for 2 min. 
+ cycle elongation of 20 sec. for each cycle 



10 



Agarose gel electrophoresis revealed a specifically amplified DNA fragment 7,000 base 
pairs long. The DNA fragment was ligated into the pCR™II vector (Invitrogen) and 
sequenced. Deduced from this sequence primer 5 and 6 coding for the 5'- and 3 '-ends, 
respectively, of the polymerase region could be designed. Primer 5 contained a Ncol site 
and primer 6 contained a BamHI site. 

The PCR was performed under the same conditions as described above (inverse PCR) 
using 750 ng genomic DNA from Carboxydothermus hydrogenoformans as template. 

SEQ ID No.: 5 

Primer 5: 5'-CGA ATT CAA TCC ATG GGA AAA GTA GTC CTG GTG GAT-3' 



SEQ ID No.: 6 

15 Primer 6: 5'-CGA ATT CAA GGA TCC TTA CTT CGC TTC ATA CCA GTT-3 1 



Cloning and expression. 



The PCR product was purified by electrophoresis of 20 \s\ of the PCR mixture on a 
20 0.8 % agarose gel. The 2.496 kb band of the polymerase coding region was purified 

from the agarose by phenol extraction. The DNA was then treated with chloroform and , 
precipitated with ethanol. The pellet was resuspended and digested with Ncol and 
BamHI according to supplier's specifications (Boehringer Mannheim GmbH) to give 
cohesive ends for directional cloning. The DNA was ligated into the expression vector 
25 pDS56 (Stuber D., Matile H. and Garotta G. (1990) Immunological Methods, Letkovcs, 
I and Pemis, B., eds.) that had also been digested with Ncol and BamHI. The ligated 
products were introduced into E.coli strain BL21(DE3) pUBS520 (Brinkmann U. et al. 
(1989) Gene 85. 109-1 14) by transformation. Transformants were grown on L-agar 
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containing 100 \xg/ml ampicillin and 50 |ig/ml kanamycin to allow selection of recom- 
binants. Colonies were picked and grown in L-broth containing 100 jig/ml ampicillin 
and 50 ^g/ml kanamycin, and plasmid DNA was prepared by alkaline lysis. The plas- 
mids were screened for insertions by digestion with BamHL Those recombinants con- 
5 taining inserts were grown in L-broth containing ampicillin and kanamycin and tested 
for the expression of thermophilic DNA polymerase by induction of exponentially 
growing culture with 1 mM IPTG and assaying the heat-treated extracts for DNA poly- 
merase activity as described above (determination of DNA polymerase activity). A re- 
combinant expressing the DNA polymerase from Carboxydothermus hydrogenoformans 
1 0 was obtained. The strain was designated E.coli AR96 (DSM No. 1 1 1 79) and the 
plasmid pAR4. 



WO 98/14589 




T/EP97/05391 



25 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Boehringer Mannheim GmbH 

(B) STREET: Sanclhof erstr . 116 

(C) CITY: Mannheim 

(E) COUNTRY: DE 

(F) POSTAL CODE (ZIP) : 68305 

(G) TELEPHONE: 06217595482 

(H) TELEFAX: 06217594457 

15 (ii) TITLE OF INVENTION: Thermostable DNA Polymerase from 

Carboxydothermus hydrogenof ormans 

(iii) NUMBER OF SEQUENCES: 8 

20 (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

25 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH; 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

40 CCNAAYYTNC ARAAYATH 18 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "oligonucleotide" 



5 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
55 YTCRTCRTGN ACYTG 



15 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
10 (A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGGCGAAGAC GCTATATTCC TGAGC 25 

15 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
25 (A) DESCRIPTION: /desc - "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAAGCCTTAA TTCAATCTGG GAATAATC 28 

30 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
40 (A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CGAATTCAAT CCATGGGAAA AGTAGTCCTG GTGGAT 36 

45 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
55 (A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CGAATTCAAG GATCCTTACT TCGCTTCATA CCAGTT ^6 

5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2496 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION : 1 . .24 96 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG GGA AAA GTA GTC CTG GTG GAT GGA AAT AGT TTA TTA CAT AGA 4 5 

Met Gly Lys Val Val Leu Val Asp Gly Asn Ser Leu Leu His Arg 
1 5 10 15 

25 

GCG TTT TTT GCC CTT CCG CCC TTA AAA ACT ACT AAA GGA GAG CCT 90 

Ala ?he Phe Ala Leu Pro Pro Leu Lys Thr Thr Lys Gly Glu Pro 
20 25 30 

30 ACC GGG GCG GTT TAC GGG TTT TTA ACG ATG CTT TTT CGG GTA ATA 135 
Thr Gly Ala Val Tyr Gly Phe Leu Thr Met Leu Phe Arg Val He 
35 40 45 



35 



45 



AAA GAT GAA AAA CCC GAA TAT TTA GCG GTA GCT TTT GAT ATT AGC 180 
Lys Asp Glu Lys Pro Glu Tyr Leu Ala Val Ala Phe Asp He Ser 
50 55 60 



CGG AAA ACT TTT CGT ACC GAG CAG TTT ACT GCA TAC AAA GGG CAC 225 
Arg Lys Thr Phe Arg Thr Glu Gin Phe Thr Ala Tyr Lvs Gly His' 
40 65 70 * 75 



CGC AAA GAA GCC CCG GAT GAG CTT GTA CCC CAG TTT GCC CTG GTG 270 
Arg Lys Glu Ala Pro Asp Glu Leu Val Pro Gin Phe Ala Leu Val 
80 85 90 

CGG GAA GTA TTA AAG GTT TTA AAT GTT CCC TAT ATT GAA CTT GAC 315 
Arg Glu Val Leu Lys Val Leu Asn Val Pro Tyr He Glu Leu Asp 
95 100 t 105 

50 GGT TAT GAG GCC GAT GAT ATT ATC GGC CAC CTA TCA AGG GCT TTT 3 60 
Gly Tyr Glu Ala Asp Asp He He Gly His Leu Ser Arg Ala Phe 
110 115 120 

GCG GGA CAA GGA CAT GAA GTG GTG ATT TAT ACC GCT GAC CGG GAC 4 05 
55 Ala Gly Gin Gly His Glu Val Val He Tyr Thr Ala Asp Arg Asp 

125 130 135 
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ATG CTG CAA TTG GTA GAT GAA AAA ACG GTG GTA TAC CTT ACC AAA 4 50 
Met Leu Gin Leu Val Asp Glu Lys Thr Val Val Tyr Leu Thr Lys 
140 145 150 

AAA GGC ATT ACC GAA CTG GTT AAA ATG GAT TTA GCT GCG ATT TTA 495 
Lys Gly He Thr Glu Leu Val Lys Met Asp Leu Ala Ala He Leu 
155 160 165 



10 GAA AAC TAC GGC TTA AAG CCT AAA CAG CTT GTG GAT GTT AAA GGA 
Glu Asn Tyr Gly Leu Lys Pro Lys Gin Leu Val Asp Val Lys Gly 
170 175 180 



540 



TTA ATG GGA GAT CCC TCG GAC AAC ATA CCC GGG GTT CCC GGG ATT 58 5 
15 Leu Met Gly Asp Pro Ser Asp Asn He Pro Gly Val Pro Gly He 

185 190 195 

GGG GAG AAA ACT GCT TTA GAT TTA ATT AAA ACT TAT GGC TCA GTG 630 
Gly Glu Lys Thr Ala Leu Asp Leu He Lys Thr Tyr Gly Ser Val 
20 200 205 210 

GAA GAA GTT TTG GCC CGT AAA GAT GAG TTA AAA CCT AAA TTA AGA 675 

Glu Glu Val Leu Ala Arg Lys Asp Glu Leu Lys Pro Lys Leu Arg 
215 220 225 

25 

GAA AAG CTT GCC GAA CAC GAA AAT TTA GCA AAA ATA TCG AAA CAA 720 

Glu Lys Leu Ala Glu His Glu Asn Leu Ala Lys He Ser Lys Gin 
230 235 240 

30 TTA GCT ACA ATC CTG CGG GAA ATA CCG TTA GAA ATC TCC CTG GAA 765 
Leu Ala Thr He Leu Arg Glu He Pro Leu Glu He Ser Leu Glu 
245 250 255 

GAT TTA AAA GTT AAA GAA CCT AAT TAT GAA GAA GTT GCT AAA TTA 810 
35 Asp Leu Lys Val Lys Glu Pro Asn Tyr Glu Glu Val Ala Lys Leu 

260 265 * 270 

TTT CTT CAC CTT GAG TTT AAA AGC TTT TTA AAA GAA ATA GAA CCA 855 
Phe Leu His Leu Glu Phe Lys Ser Phe Leu Lvs Glu He Glu Pro 
40 275 280 * 285 

AAA ATA AAG AAA GAA TAC CAG GAA GGT AAA GAT TTG GTG CAA GTT 900 

Lys He" Lys Lys Glu Tyr Gin Glu Gly Lys Asp Leu Val Gin Val 
290 295 300 

45 

GAA ACT GTA GAA ACG GAA GGA CAG ATT GCA GTA GTT TTT AGT GAT 945 

Glu Thr Val Glu Thr Glu Gly Gin He Ala Val Val Phe Ser Asd 
305 310 315 

50 GGA TTT TAT GTT GAT GAC GGG GAA AAA ACA AAG TTT TAC TCT TTA 990 
Gly Phe Tyr Val Asp Asp Gly Glu Lys Thr Lys Phe Tyr Ser Leu 
320 325 330 

GAC CGG CTG AAT GAA ATA GAG GAA ATA TTT AGG AAT AAA AAA ATT 1035 
55 Asp Arg Leu Asn Glu He Glu Glu He Phe Arg Asn Lvs Lvs He 

335 340 " " 345 
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ATT ACC GAC GAT GCC AAA GGA ATT 
lie Thr Asp Asp Ala Lys Gly lie 
350 

5 GGT CTG ACT TTT CCC GAA GTT TGT 
Gly Leu Thr Phe Pro Glu Val Cys 
365 

TAT GTT TTA AAC CCG GCC GAC CAA 
10 Tyr Val Leu Asn Pro Ala Asp Gin 

380 

TAT CTA AAG TAT GAC TTA CCG GTG 
Tyr Leu Lys Tyr Asd Leu Pro Val 
15 395 

ATT AGA GGG TTG TTT TAT TTA AAA 
lie Arg Gly Leu Phe Tyr Leu Lys 
410 

20 

TTT GAG CAG GAG CAA GAA AGG TTA 
Phe Glu Gin Glu Gin Glu Arg Leu 
425 

25 TTA ACT CCA GTT CTT GCT CAA ATG 
Leu Thr Pro Val Leu Ala Gin Met 
440 

GAC CGG GAA GCT TTA AAA GAG ATG 
30 Asp Arg Glu Ala Leu Lys Glu Met 

455 

ATT GAA GAG TTA ATC CGG GAA ATT 
lie Glu Glu Leu lie Arg Glu lie 
35 470 

TTT AAC TTA AAC TCG CCC AGG CAG 
Phe Asn Leu Asn Ser Pro Arg Gin 
485 

40 

AAA CTT GGG CTG CCG GTA ATT AAA 
Lys Leu Gly Leu Pro Val lie Lys 
500 

45 ACC GAT GCG GAG GTT TTG GAA GAG 
Thr Asp Ala Glu Val Leu Glu Glu 
515 

GGC ATC GGC AAA ATA TTG AAT TAC 
50 lie Gly Lys lie Leu Asn Tyr Arg 

530 

TCC ACT TAT ACT GAC TTA ATG CCT 
Thr Tyr Thr Asp Gly Leu Met Pro 
55 545 
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TAT CAT GTC TGT TTA GAA AAA 1080 
Tyr His Val Cys Leu Glu Lys 
355 360 

TTT GAT GCG CGG ATT GCA GCT 1125 
Phe Asp Ala Arg lie Ala Ala 
370 375 

AAT CCC GGC CTC AAG GGG CTT 1170 
Asn Pro Gly Leu Lys Gly Leu 
385 390 

TAT GAA GAT GTA TCT TTA AAC 1215 
Tyr Glu Asp Val Ser Leu Asn 
400 405 

AAA GAA ATG ATG AGA AAA ATC 1260 
Lys Glu Met Met Arg Lys lie 
415 420 

TTT TAT GAA ATA GAA CTT CCT 1305 
Phe Tyr Glu lie Glu Leu Pro 
430 435 

GAG CAT ACC GGC ATT CAG GTT 1350 
Glu His Thr Gly lie Gin Val 
445 450 

TCG TTA GAG CTG GGA GAG CAA 1395 
Ser Leu Glu Leu Gly Glu Gin 
460 465 

TAT GTG CTG GCG GGG GAA GAG 14 40 
Tyr Val Leu Ala Gly Glu Glu 
475 480 

CTG GGA GTT ATT CTT TTT GAA 14 85 
Leu Gly Val lie Leu Phe Glu 
490 495 

AAG ACC AAA ACG GGC TAC TCT 1530 
Lys Thr Lys Thr Gly Tyr Ser 
505 510 

CTC TTG CCT TTC CAC GAA ATT 1575 
Leu Leu Pro Phe His Glu lie 
520 525 

CGG CAG CTT ATG AAG TTA AAA 1620 
Gin Leu Met Lys Leu Lys Ser 
535 540 

TTA ATA AAT GAG CGT ACC GGT 1665 
Leu lie Asn Glu Arg Thr Gly 
550 555 
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30 



45 



50 



AAA CTT CAC ACT ACT TTT AAC CAG ACC GGT ACT TTA ACC GGA CGC 1710 

Lys Leu His Thr Thr Phe Asn Gin Thr Gly Thr Leu Thr Giy Arg 
560 565 570 

CTG GCG TCT TCG GAG CCC AAT CTC CAA AAT ATT CCC ATC CGG TTG 1755 

Leu Ala Ser Ser Glu Pro Asn Leu Gin Asn He Pro He Arg Leu 
575 580 585 

GAA CTC GGT CGG AAA TTA CGC AAG ATG TTT ATA CCT TCA CCG GGG 1800 

Glu Leu Gly Arg Lys Leu Arg Lys Met Phe He Pro Ser Pro Gly 
590 595 600 



TAT GAT TAT ATT GTT TCG GCG GAT TAT TCC CAG ATT GAA TTA AGG 184 5 
Tyr Asp Tyr He Val Ser Ala Asp Tyr Ser Gin He Glu Leu Arg 
15 605 610 615 

CTT CTT GCC CAT TTT TCC GAA GAG CCC AAG CTT ATT GAA GCT TAC 1890 
Leu Leu Ala His Phe Ser Glu Glu Pro Lys Leu He Glu Ala Tyr 
620 625 630 



CAA AAA GGG GAG GAT ATT CAC CGG AAA ACG GCC TCC GAG GTG TTC 1935 
Gin Lys Gly Glu Asp He His Arg Lys Thr Ala Ser Glu Val Phe 
635 640 645 

25 GGT GTA TCT TTG GAA GAA GTT ACT CCC GAG ATG CGC GCT CAT GCC 1980 
Gly Val Ser Leu Glu Glu Val Thr Pro Glu Met Arg Ala His Ala 
650 655 660 

AAG TCG GTG AAC TTC GGC ATT GTT TAT GGC ATT AGT GAT TTT GGT 2025 
Lys Ser Val Asn Phe Gly lie Val Tyr Gly He Ser Asp Phe Gly 
665 670 675 

TTA GGC AGA GAC TTA AAG ATT CCC CGG GAG GTT GCC GGT AAG TAC 2070 
Leu Gly Arg Asp Leu Lys He Pro Arg Glu Val Ala Gly Lys Tyr 
35 680 685 690 

ATT AAA AAT TAT TTT GCC AAC TAT CCC AAA GTG CGG GAG TAT CTC 2115 

He Lys Asn Tyr Phe Ala Asn Tyr Pro Lys Val Arg Glu Tyr Leu 
695 700 705 

40 

GAT GAA CTT GTC CGT ACG GCA AGA GAA AAG GGA TAT GTG ACC ACT 2160 

Asp Glu Leu Val Arg Thr Ala Arg Glu Lys Gly Tyr Val Thr Thr 
710 715 720 

TTA TTT GGG CGA AGA CGC TAT ATT CCT GAG CTA TCT TCA AAA AAC 2205 
Leu Phe Giy Arg Arg Arg Tyr He Pro Glu Leu Ser Ser Lys Asn 
725 730 735 

CGC ACG GTT CAG GGT TTT GGC GAA AGG ACG GCC ATG AAT ACT CCC 2250 
Arg Thr Val Gin Gly Phe Giy Glu Arg Thr Ala Met Asn Thr Pro 
740 745 750 

CTT CAG GGC TCG GCT GCC GAT ATT ATT AAG CTT GCA ATG ATT AAT 2295 
Leu Gin Gly Ser Ala Ala Asp He He Lys Leu Ala Met He Asn 
55 755 760 765 
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GTA GAA AAA GAA CTT AAA GCC CGT AAG CTT AAG TCC CGG CTC CTT 234 0 

Val Glu Lys Glu Leu Lvs Ala Arg Lys Leu Lys Ser Arg Leu Leu 

770 775 780 

5 CTT TCG GTG CAC GAT GAG TTA GTT TTA GAA GTG CCG GCG GAA GAG 2385 

Leu Ser Val His Asp Glu Leu Val Leu Glu Val Pro Ala Glu Glu 

785 790 795 

CTG GAA GAG GTA AAA GCG CTG GTA AAA GGG GTT ATG GAG TCG GTG 2 430 

10 Leu Glu Glu Val Lys Ala Leu Val Lys Gly Val Met Glu Ser Val 

800 805 810 

GTT GAA CTG AAA GTG CCT TTA ATC GCT GAA GTT GGT GCA GGC AAA 2475 

Val Giu Leu Lys Val Pro Leu He Ala Glu Val Gly Ala Gly Lys 

15 815 820 825 

AAC TGG TAT GAA GCG AAG TAA 

Asn Tro Tyr Glu Ala Lys * 
830 



20 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 831 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Gly Lys Val Val Leu Val Aso Gly Asn Ser Leu Leu His Arg 
1 5 " 10 15 

35 Ala Phe Phe Ala Leu Pro Pro Leu Lys Thr Thr Lys Gly Glu Pro 

20 25 30 

Thr Gly Ala Val Tyr Gly Phe Leu Thr Met Leu Phe Arg Val He 
35 40 45 



40 



Lys Asp Glu Lys Pro Giu Tyr Leu Ala Val Ala Phe Asp He Ser 
50 55 60 



Arg Lvs Thr Phe Arg Thr Glu Gin Phe Thr Ala Tyr Lvs Gly His 
45 65 70 75 

Arg Lys Giu Ala Pro Asp Giu Leu - Val Pro Gin Phe Ala Leu Val 
80 35 90 

50 Arg Glu Val Leu Lys Val Leu Asn Val Pro Tyr He Glu Leu Asp 

95 100 105 



Gly Tyr Glu Ala Asp Asp He He Gly His Leu Ser Arg Ala Phe 
110 115 120 
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Ala Gly Gin Gly His Glu Val Val 
125 

Met Leu Gin Leu Val Asp Glu Lys 
140 



lie Tvr Thr Ala Asp Arg Asd 
130 135 

Thr Val Val Tyr Leu Thr Lys 
145 150 



Lys Gly lie Thr Glu Leu Val Lys 
155 



Met Asp Leu Ala Ala lie Leu 
160 165 



Glu Asn Tyr Gly Leu Lys Pro Lys 
10 170 



Gin Leu Val Asp Val Lys Gly 
175 180 



Leu Met Gly Asp Pro Ser Asp Asn 
185 

15 Gly Glu Lys Thr Ala Leu Asp Leu 

200 



lie Pro Gly Val Pro Gly lie 
190 195 

lie Lys Thr Tyr Gly Ser Val 
205 210 



20 



Glu Glu Val Leu Ala Arg Lvs Asp 
215 

Glu Lys Leu Ala Glu His Glu Asn 
230 



Glu Leu Lvs Pro Lys Leu Arg 

220 - 225 

Leu Ala Lys lie Ser Lys Gin 

235 240 



Leu Ala Thr lie Leu Arg Glu lie 
25 245 



Pro Leu Glu lie Ser Leu Glu 
250 255 



Asp Leu Lys Val Lys Glu Pro Asn 
2 60 

30 Phe Leu His Leu Glu Phe Lys Ser 

275 



Tyr Glu Glu Val Ala Lys Leu 
265 270 

Phe Leu Lys Glu lie Glu Pro 
280 285 



35 



Lys He Lys Lys Glu Tyr Gin Glu 
290 

Glu Thr Val Glu Thr Glu Gly Gin 
305 



Gly Lys Asp Leu Val Gin Val 

295 300 

He Ala Val Val Phe Ser Asp 

310 315 
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Gly Phe Tyr Val Asp Asp Gly Glu Lys Thr Lys Phe Tyr Ser Leu 

320 325 330 

Asp Arg Leu Asn Glu He Glu Glu He Phe Ara Asn Lys Lys He 

5 335 340 - " 345 

He Thr Asp Asp Ala Lys Gly He Tyr His Val Cys Leu Glu Lys 

350 355 360 

10 Gly Leu Thr Phe Pro Glu Val Cys Phe Asp Ala Arg He Ala Ala 

365 370 375 

Tyr Val Leu Asn Pro Ala Asp Gin Asn Pro Gly Leu Lys Gly Leu 

380 385 390 

Tyr Leu Lys Tyr Asp Leu Pro Val Tyr Glu Asp Val Ser Leu Asn 

395 400 405 

He Arg Gly Leu Phe Tyr Leu Lys Lys Glu Met Met Arg Lvs He 

20 410 415 * 420 

Phe Glu Gin Glu Gin Glu Arg Leu Phe Tyr Glu He Glu Leu Pro 

425 430 435 

25 Leu Thr Pro Val Leu Ala Gin Met Glu His Thr Gly He Gin Val 

440 445 450 



30 



Asp Arg Glu Ala Leu Lys Glu Met Ser Leu Glu Leu Gly Glu Gin 
455 460 465 

He Glu Glu Leu He Arg Glu He Tyr Val Leu Ala Gly Glu Glu 
470 , 475 480 



Phe Asn Leu Asn Ser Pro Arg Gin Leu Gly Val He Leu Phe Glu 
35 485 490 495 
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Lys Leu Gly Leu Pro Val lie Lys Lys Thr Lys Thr Gly Tyr Ser 

500 505 5io 

Thr Asp Ala Glu Val Leu Glu Glu Leu Leu Pro Phe His Glu He 

5 515 520 525 

He Gly Lys He Leu Asn Tyr Arg Gin Leu Met Lys Leu Lys Ser 

530 535 540 

10 Thr Tyr Thr Asp Gly Leu Met Pro Leu He Asn Glu Arg Thr Gly 

545 550 555 



15 



Lys Leu His Thr Thr Phe Asn Gin Thr Gly Thr Leu Thr Gly Arg 
560 565 570 

Leu Ala Ser Ser Glu Pro Asn Leu Gin Asn lie Pro He Arg Leu 
575 580 585 



Glu Leu Gly Arg Lys Leu Arg Lys Met Phe lie Pro Ser Pro Gly 

20 590 595 600 

Tyr Asp Tyr. He Val Ser Ala Asp Tyr Ser Gin lie Glu Leu Arg 

605 610 615 

25 Leu Leu Ala His Phe Ser Glu Glu Pro Lys Leu lie Glu Ala Tyr 

620 625 630 



30 



Gin Lys Gly Glu Asp lie His Arg Lys Thr Ala Ser Glu Val Phe 

635 640 645 

Gly Val Ser Leu Glu Glu Val Thr Pro Glu Met Arg Ala His Ala 

650 655 660 



Lys Ser Val Asn Phe Gly lie Val Tyr Gly lie Ser Asd Phe Gly 
35 665 670 ' 675 
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Leu Gly Arg Asp Leu Lys lie Pre Arg Glu Val Ala Gly Lys Tyr 

680 " 685 690 

lie Lvs Asn Tyr Phe Ala Asn Tyr Pro Lys Val Arg Glu Tyr Leu 

5 " 695 700 705 

Asp Glu Leu Val Arg Thr Ala Arg Glu Lys Gly Tyr Val Thr Thr 

710 715 720 

10 Leu Phe Gly Arg Arg Arg Tyr lie Pro Glu Leu Ser Ser Lys Asn 

725 730 735 



15 



Arg Thr Val Gin Gly Phe Gly Glu Arg Thr Ala Met Asn Thr Pro 
740 745 750 

Leu Gin Gly Ser Ala Ala Asp lie lie Lys Leu Ala Met lie Asn 
755 760 765 



Val Glu Lys Glu Leu Lys Ala Arg Lys Leu Lys Ser Arg Leu Leu 

20 770 775 780 

Leu Ser Val His Asd Glu Leu Val Leu Glu Val Pro Ala Glu Glu 

785 790 795 

25 Leu Glu Glu Val Lys Ala Leu Val Lys Gly Val Met Glu Ser Val 

800 805 810 

Val Glu Leu Lys Val Pro Leu He Ala Glu Val Gly Ala Gly Lys 

815 820 825 

30 

Asn Trp Tyr Glu Ala Lys * 
830 
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INTERNATIONAL FORM 



Boehringer Mannheim GmbH 




Sandhofer Str. 116 




6 83 05 Mannheim 


RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 
issued pursuant to Rule 7.1 by the 

INTERNATIONAL DEPOSITARY AUTHORITY 
identified at the bottom of this page 


!. IDENTIFICATION OF THE MICROORGANISM 


Identification reference given by the DEPOSITOR: 

DSM 8979 


Accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY: 

DSM 8 979 


U. SCIENTIFIC DESCRIPTION AND/OR PROPOSED TAXONOM3C DESIGNATION 


Hie microorganism identified under I. above was accompanied bv: 




( ) a scientific description 

(X ) a proposed uxonomic designation 




(Mark with a cross where applicable). 




III. RECEIPT AND ACCEPTANCE 


This International Deoositary Authority accepts the microorganism identified under I. above, which was received by 
iton 1994-02-10 (Date of the original deposit)'. 


rV. RECEIPT OF REQUEST FOR CONVERSION 


The microo rear ism identified under 1 above was received by this International Depositary Authority on 

(date of original deposit) and a request to convert the original deposit to a deposit under the Budapest Treaty was 

received by it on (due of receipt of request for conversion). 


V. INTERNATIONAL DEPOSITARY AUTHORITY 


Nime: DSM-DEUTSCHE SAMMLUNG VON 

MIKROORGANISMEN UND ZELLKULTUREN GmbH 

Address: Mascheroder Weg lb 
0-38 1 24 Braunschweig 


Signature! s) of persoms) having the power 

to represent the International Depositary Authority 

or of authorized otficiaJ(s): 

(J L^Jert^ 

Dale: 1994-09-21 



' Where Rule 6.4 (d) applies, sucn date is the date on which ihc status of international depositary autnoriry was acquired. 
Form DSM-BP/4 (sole page) 07V94 
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Boehringer Mannheim GmbH 




Sandhofer Str. 116 




68305 Mannheim 


VIABILITY STATEMENT 
issued pursuant to Rule 10.2 by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified ai the bottom of this page 


1. DEPOSITOR 


IL IDENTIFICATION OF THE MICROORGANISM 


Nimc: Boehringer Mannheim GmbH 
Address: Sandhofer Str. 116 


Accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY: 

DSM 8979 


68305 Mannheim 


Date of the deposit or the transfer 1 : 
1994-02-10 


III. VIABILITY STATEMENT 


The viability of ihc microorganism identified under II above was tested on 1994-02-16 1 . 
On thai dale, the said microorganism was 


(X)' viable 




( )' no longer viable 




IV. CONDITIONS UNDER WHICH THE VIABILITY TEST HAS BEEN PERFORMED* 




V. INTERNATIONAL DEPOSITARY AUTHORITY 


Name: D5M-DEUTSCHE SAMMLUNG VON 

MUCROORGANISMEN UND ZELLKULTUREN GmbH 

Address: Mascheroder Weg lb 
D-3SI24 Braunschweig 


Signanire(s) of pcnon(s) having the power 

to represent the International Depositary Authority 

or of authorized offtciaJ(s): 

Date 1594-09-21 



1 Indicate the date of original deposit or, where a new deposit or a transfer has been made, the most recent relevant 

date (dale of the new deposit or date of the transfer). 
1 In the cases referred to in Rule 10.2(a) (ii) and(iii). refer to the most recent viability test 
' Mark with a cross the applicable box. 

Fill tn if the informauon has been requested and if the results of the test were negative. 



Form DSM-BP^ (sole page) 07/94 
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Claims 

1 . A purified thermostable DNA polymerase obtainable from Carboxydothermus 
hydrogenoformans that catalyses the template directed polymerisation of DNA, 

5 posesses 3'-5' exonuclease activity and has reverse transcriptase activity in the 

presence of magnesiums ions and in the substantial absence of manganese ions. 

2. The polymerase according to claim 1 , wherein said polymerase exhibits a reverse 
transcriptase activity which is Mn 2+ dependent. 

10 

3. The polymerase as claimed in any of claims 1-2, wherein the magnesium ions 
dependent reverse transcriptase activity of said polymerase is higher than the DNA 
polymerase activity of said polymerase. 

1 5 4. The polymerase as claimed in any of claims 1 -3, wherein the magnesium ions 
dependent reverse transcriptase activity of said polymerase is higher than the 
manganese dependent reverse transcriptase activity of said polymerase. 

5. The polymerase as claimed in any of claims 1 -4, wherein said polymerase has an 
20 apparent molecular weight between about 100 to 105 kDa as determined by SDS 

polyacrylamide electrophoresis. 

6. The polymerase as claimed in any of claims 1-5, wherein said polymerase is 
obtainable from E. coli BL21 (DE3) pUBS520, the strain being designated AR96. 

15 

7. An isolated DNA sequence coding for the polymerase as claimed in any one of 
claims 1-6 obtainable from Carboxydothermus hydrogenoformans. 



8. 

30 



A recombinant DNA sequence capable of encoding polymerase activity of the 
microorganism Carboxydothermus hydrogenoformans. 
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9. An isolated DNA sequence represented by the formula shown in SEQ ID No. 7. 

10. A vector containing the isolated DNA sequence as claimed in any of claims 7-9. 

5 11. The vector according to claim 10, wherein such vector is plasmid pDS56 carrying 
the Carboxydothermus hydrogenoformas DNA polymerase gene and is then 
designated pAR4. 

12. The vector according to claims 10 and 1 1 providing some or all of the following 
10 features: 

(1) promotors or sites of initiation of transcription 

(2) operators which could be used to turn gene expression on or off 

(3) ribosome binding sites for improved translation 

(4) transcription or translation termination sites 



15 
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13. A microbial host transformed with the vector of claims 10-12. 

14. A microbial host according to claim 13 wherein said transformant is from E. coli 
BL21 (DE3) pUBS520, the strain being designated AR96. 



15. A process for the preparation of DNA polymerase according to any of the claims 
1-6 comprising the steps: 

(a) culturing the natural strain Carboxydothermus hydrogenoformas 

(b) suspending the cells of the natural strain in buffer 
25 (c) disrupting the cells 

(d) purifying the DNA polymerase by chromatographic steps including 
the use of one or more Sepharose-columns. 



16. 

30 



A process for the preparation of DNA polymerase according to any one of claims 
1-6 comprising growing a recombinant E. coli strain transformed with a vector 
according to claims 10-12 and purifying and isolating the DNA polymerase. 
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1 7. A process of amplifying DNA, characterized in that a thermostable DNA 
polymerase as claimed in any one of claims 1-6 is used. 

5 18. A process for second cDNA cloning and DNA sequencing, characterized in that a 
thermostable DNA polymerase as claimed in any one of claims 1 -6 is used. 

19. A process for DNA labeling, characterized in that a thermostable DNA 
polymerase as claimed in any one of claims 1 -6 is used. 



20. A process for reverse transcription, characterized in that a thermostable DNA 
polymerase as claimed in any one of claims 1 -6 is used. 



10 
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Figure 1: 



RLU 
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Figure 2: 

Detection of reverse transcriptase activity of DNA polymerase from 
Carboxydothermus hydrogenoformans and reference polymerases 
in situ. 



1 2 3 4 5 6 7 8 9 10 




1: E.coli pol I (20U) + Klenow fragment (10U) 

2: Cell extract from C. hydrogenoformans (2 pi) 

3: Chy polymerase, rec. (7U) 

4: Taq polymerase (20U) 

5: Cell extract from C. hydrogenoformans (2 pi) 

6: Chy polymerase, rec. (14U) 

7: E.coli pol I (10U) + Klenow fragment (5U) 

8: Cell extract from C. hydrogenoformans (4 pi) 

9: Chy polymerase, rec. (70U) 

10: Taq polymerase (20U) 
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Figure 3: 
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n 



o 




* 



3* ft 4 



Primer, no Pol. 
Templote/Primer , no Pol. 
Reaction mixture, no dNTP" 
18nM dNTP's 
100nM dNTP's 
IhM dNTP's 
lCuM dNTP's 
100 M M dNTP's 
6O0|iM dMTP s 



"3 

m 
n 



3" 



O 



1 



o 



o 



to 
n 



O 

i 

O 




Reaction mixture, no dNTP ' 

10nM dNTP's 

100nM dNTP's 

l|iM dNTP's 

10,iM dNTP's 

100nM dNTP's 

600 M M dNTP's 

Reaction mixture, no dNTP* 

10nM dNTP's 

100nM dNTP's 

luM dNTP's 

10*iM dNTP's 

1O0>iM dNTP's 

600 f iM dNTP's 

Reaction mixture, no dNTP' 

10nM dNTP's 

100nM dNTP's 

1?H dNTP's 

10 M M dNTP's 

100 M M dNTP's 

600mM dNTP's 
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Figure 5: 

SEQ ID No. : 7 
SEQ ID NO. : 8 

ATG GGA AAA GTA GTC CTG GTG GAT GGA AAT AGT TTA TTA CAT AGA GCG 4 8 

MGKVVLVDGNSLLHRA16 
TTT TTT GCC CTT CCG CCC TTA AAA ACT ACT AAA GGA GAG CCT ACC GGG 96 

FFALPPLKTTKGEPTG32 
GCG GTT TAC GGG TTT TTA ACG ATG CTT TTT CGG GTA ATA AAA GAT GAA 144 

AVYGFLTMLFRVIKDE48 
AAA CCC GAA TAT TTA GCG GTA GCT TTT GAT ATT AGC CGG AAA ACT TTT 192 

K P EYLAVAF D I S RK T F 64 
CGT ACC GAG CAG TTT ACT GCA TAC AAA GGG CAC CGC AAA GAA GCC CCG 24 0 

RTEQFTAYKGHRKEAP80 
GAT GAG CTT GTA CCC CAG TTT GCC CTG GTG CGG GAA GTA TTA AAG GTT 288 

D E L V PQ FALVREVL KV 96 
TTA AAT GTT CCC TAT ATT GAA CTT GAC GGT TAT GAG GCC GAT GAT ATT 336 

LNVPY I ELDGYEAQ DI 112 
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ATC GGC CAC CTA TCA AGG GCT TTT GCG GGA CAA GGA CAT GAA GTG GTG 38 4 



IGHLSRAFAGQGHEVV 128 



ATT TAT ACC GCT GAC CGG GAC ATG CTG CAA TTG GTA GAT GAA AAA ACG 4 32 



I YTADRDMLQLVDEKT 144 



GTG GTA TAC CTT ACC AAA AAA GGC ATT ACC GAA CTG GTT AAA ATG GAT 4 80 



VVYLTKKGITELVKMD 160 



TTA GCT GCG ATT TTA GAA AAC TAC GGC TTA AAG CCT AAA CAG CTT GTG 528 



LAAI LENYGLKPKQLV 176 



GAT GTT AAA GGA TTA ATG GGA GAT CCC TCG GAC AAC ATA CCC GGG GTT 57 6 



DVKGLMGDPSDNIPGV 192 



CCC GGG ATT GGG GAG AAA ACT GCT TTA GAT TTA ATT AAA ACT TAT GGC 624 



PGIGEKTA 



D L I K T Y G 208 



TCA GTG GAA GAA GTT TTG GCC CGT AAA GAT GAG TTA AAA CCT AAA TTA 67 2 



SV EE VL ARK DEL K? KL224 



AGA GAA AAG CTT GCC GAA CAC GAA AAT TTA GCA AAA ATA TCG AAA CAA 720 



KLAEHENLAK 



S K Q 240 
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TTA GCT ACA ATC CTG CGG GAA ATA CCG TTA GAA ATC TCC CTG GAA GAT 7 68 

L.ATILREIPLE ISLED 256 

TTA AAA GTT AAA GAA CCT AAT TAT GAA GAA GTT GCT AAA TTA TTT CTT 816 

LKVKEPNYEEVAKLFL272 

CAC CTT GAG TTT AAA AGC TTT TTA AAA GAA ATA GAA CCA AAA ATA AAG 8 64 

HLEFKSFLKEI EPKIK 288 

AAA GAA TAC CAG GAA GGT AAA GAT TTG GTG CAA GTT GAA ACT GTA GAA 912 

KEYQEGKDLVQVETVE304 

ACG GAA GGA CAG ATT GCA GTA GTT TTT AGT GAT GGA TTT TAT GTT GAT 960 

TEGQIAVVFSDGFYVD 320 

GAC GGG GAA AAA ACA AAG TTT TAC TCT TTA GAC CGG CTG AAT GAA ATA 1008 

DGEKTKFYSLDRLNEI 336 

GAG GAA ATA TTT AGG AAT AAA AAA ATT ATT ACC GAC GAT GCC AAA GGA 1056 

EEIFRNKK I ITDDAKG 352 

ATT TAT CAT GTC TGT TTA GAA AAA GGT CTG ACT TTT CCC GAA GTT TGT 1104 

I YHVCLEK GLT FPEVC 368 
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TTT GAT GCG CGG ATT GCA GCT TAT GTT TTA AAC CCG GCC GAC CAA AAT 

FDARIAAYVLNPADQN 
CCC GGC CTC AAG GGG CTT TAT CTA AAG TAT GAC TTA CCG GTG TAT GAA 

PGL KGLY LKYDLP V YE 
GAT GTA TCT TTA AAC ATT AGA GGG TTG TTT TAT TTA AAA AAA GAA ATG 

DVSLNIRGLFYLKK EM 
ATG AGA AAA ATC TTT GAG CAG GAG CAA GAA AGG TTA TTT TAT GAA ATA 

MRKIFEQEQERLFYEI 
GAA CTT CCT TTA ACT CCA GTT CTT GCT CAA ATG GAG CAT ACC GGC ATT 

ELPLTPVLAQMEHTGI 
CAG GTT GAC CGG GAA GCT TTA AAA GAG ATG TCG TTA GAG CTG GGA GAG 

QVDREALKEMSLEL GE 
CAA ATT GAA GAG TTA ATC CGG GAA ATT TAT GTG CTG GCG GGG GAA GAG 

■QIEELIREI YVLAG EE 
TTT AAC TTA AAC TCG CCC AGG CAG CTG GGA GTT ATT CTT TTT GAA AAA 

FNLNS PRQLGVI LF EK 
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CTT GGG CTG CCG GTA ATT AAA AAG ACC 

LGLPVIKKT 
GCG GAG GTT TTG GAA GAG CTC TTG CCT 

AEVLEELL? 
ATA TTG AAT TAC CGG CAG CTT ATG AAG 

ILNYRQLMK 
GGC TTA ATG CCT TTA ATA AAT GAG CGT 

GLMPLINE'R 
TTT AAC CAG ACC GGT ACT TTA ACC GGA 

F^QTGTLTG 
AAT CTC CAA AAT ATT CCC ATC CGG TTG 

NLQNIPIRL 
AAG ATG TTT ATA CCT TCA CCG GGG TAT 

KMFIPSPGY 
TAT TCC CAG ATT GAA- TTA AGG CTT CTT 

YSQIELRLL 
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AAA ACG GGC TAC TCT ACC GAT 1536 

K T G Y 3 T D 512 

TTC CAC GAA ATT ATC GGC AAA 158 4 

F H E I I G K 528 

TTA AAA TCC ACT TAT ACT GAC 1632 

L K S T Y T D 544 

ACC GGT AAA CTT CAC ACT ACT 1680 

T G K L H T T 560 

CGC CTG GCG TCT TCG GAG CCC 1728 

R L A S S E P 576 

GAA CTC GGT CGG AAA TTA CGC 1776 

E L G R K L R 592 

GAT TAT ATT GTT TCG GCG GAT 1824 

D Y I V S A D 608 

GCC CAT TTT TCC GAA GAG CCC 1872 

AH FS E E P 624 
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AAG CTT ATT GAA GCT TAC CAA AAA GGG GAG GAT ATT CAC CGG AAA ACG 

KLIEAYQKGEDIHRKT 
GCC TCC GAG GTG TTC GGT GTA TCT TTG GAA GAA GTT ACT CCC GAG ATG 

ASEVFGVS LEEVT? EM 
CGC GCT CAT GCC AAG TCG GTG AAC TTC GGC ATT GTT TAT GGC ATT AGT 

RA HAKSVNFGIVYGIS 
GAT TTT GGT TTA GGC AGA GAC TTA AAG ATT CCC CGG GAG GTT GCC GGT 

DFGL.GRDLKI PRE V AG 
AAG TAC ATT AAA AAT TAT TTT GCC AAC TAT CCC AAA GTG CGG GAG TAT 

KYIKNYFANYPKVREY 
CTC GAT GAA CTT GTC CGT ACG GCA AGA GAA AAG GGA TAT GTG ACC ACT 

LDELVRTAREKGYVTT 
TTA TTT GGG CGA AGA CGC TAT ATT CCT GAG CTA TCT TCA AAA AAC CGC 

LFGRRRYIPELSSKNR 
ACG GTT CAG GGT TTT GGC GAA AGG ACG GCC ATG AAT ACT CCC CTT CAG 

T V Q G FGERTAMNTP LQ 
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GGC TCG GCT GCC GAT ATT ATT AAG CTT GCA ATG ATT AAT GTA GAA AAA 2304 

GSAADI IKLAMINVEK 768 

GAA CTT AAA GCC CGT AAG CTT AAG TCC CGG CTC CTT CTT TCG GTG CAC 2352 

ELKARKLKSRLLLSVH784 

GAT GAG TTA GTT ?TA GAA GTG CCG GCG GAA GAG CTG GAA GAG GTA AAA 24 00 

OELVL'EVPAEELEEVK 800 

GCG CTG GTA AAA GGG GTT ATG GAG TCG GTG GTT GAA CTG AAA GTG CCT 24 4 8 

A L V K G V M " E S V V E L K V P 816 

TTA ATC GCT GAA GTT GGT GCA GGC AAA AAC TGG TAT GAA GCG AAG TAA 24 96 

L I AEVGAG K N W Y EA K * 831 
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